Tuning HeidelTime for identifying time expressions in clinical texts in English and French
نویسندگان
چکیده
We present work on tuning the Heideltime system for identifying time expressions in clinical texts in English and French languages. The main amount of the method is related to the enrichment and adaptation of linguistic resources to identify Timex3 clinical expressions and to normalize them. The test of the adapted versions have been done on the i2b2/VA 2012 corpus for English and a collection of clinical texts for French, which have been annotated for the purpose of this study. We achieve a 0.8500 F-measure on the recognition and normalization of temporal expressions in English, and up to 0.9431 in French. Future work will allow to improve and consolidate the results.
منابع مشابه
French Resources for Extraction and Normalization of Temporal Expressions with HeidelTime
In this paper, we describe the development of French resources for the extraction and normalization of temporal expressions with HeidelTime, a open-source multilingual, cross-domain temporal tagger. HeidelTime extracts temporal expressions from documents and normalizes them according to the TIMEX3 annotation standard. Several types of temporal expressions are extracted: dates, times, durations ...
متن کاملChinese Temporal Tagging with HeidelTime
Temporal information is important for many NLP tasks, and there has been extensive research on temporal tagging with a particular focus on English texts. Recently, other languages have also been addressed, e.g., HeidelTime was extended to process eight languages. Chinese temporal tagging has achieved less attention, and no Chinese temporal tagger is publicly available. In this paper, we address...
متن کاملHeidelTime: Tuning English and Developing Spanish Resources for TempEval-3
In this paper, we describe our participation in the TempEval-3 challenge. With our multilingual temporal tagger HeidelTime, we addressed task A, the extraction and normalization of temporal expressions for English and Spanish. Exploiting HeidelTime’s strict separation between source code and languagedependent parts, we tuned HeidelTime’s existing English resources and developed new Spanish reso...
متن کاملA Comparative Study of Nominalization in an English Applied Linguistics Textbook and its Persian Translation
Among the linguistic resources for creating grammatical metaphor, nominalization rewords processes and properties metaphorically as nouns within the experiential metafunction of language. Following Halliday's (1998a) classification of grammatical metaphor, the current study investigated nominalization exploited in an English applied linguistics textbook and its corresponding Persian translati...
متن کامل(Un)Translatability of Persian Idiomatic Expressions to English in Political Discourse
The present study sought to investigate the extent to which Persian idiomatic expressions would influence the western translators' strategies in providing the ultimate product in English, and it also attempted to uncover the underlying assumptions in target text, then to suggest some weighty strategies to overcome difficulties with translation. For this purpose, the data was analyzed within the...
متن کامل